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Statistical Beamforming on the Grassmann 
Manifold for the Two-User Broadcast Channel 

Vasanthan Raghavan*, Stephen V. Hanly, Venugopal V. Veeravalli 



Abstract 



A Rayleigh fading spatially correlated broadcast setting with M ^ 2 antennas at the transmitter and 
two-users (each with a single antenna) is considered. It is assumed that the users have perfect channel 
information about their links whereas the transmitter has only statistical information of each user's link 
(covariance matrix of the vector channel). A low-complexity linear beamforming strategy that allocates 
equal power and one spatial eigen-mode to each user is employed at the transmitter Beamforming 
vectors on the Grassmann manifold that depend only on statistical information are to be designed at 
the transmitter to maximize the ergodic sum-rate delivered to the two users. Towards this goal, the 
fS| . beamforming vectors are first fixed and a closed-form expression is obtained for the ergodic sum-rate in 

■ terms of the covariance matrices of the links. This expression is non-convex in the beamforming vectors 

ensuring that the classical Lagrange multiplier technique is not applicable. Despite this difficulty, the 
optimal solution to this problem is shown to be the solution to the maximization of an appropriately - 
defined average signal-to-interference and noise ratio (SINR) metric for each user This solution is the 



5-H ■ dominant generalized eigenvector of a pair of positive-definite matrices where the first matrix is the 

covariance matrix of the forward link and the second is an appropriately-designed "effective" interference 
covariance matrix. In this sense, our work is a generalization of optimal signalling along the dominant 
eigen-mode of the transmit covariance matrix in the single-user case. Finally, the ergodic sum-rate 



V. Raghavan is with the Department of Mathematics, University of Southern California, Los Angeles, CA 90089, USA. He 
was with the Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA and the 
Department of Electrical and Electronic Engineering, The University of Melbourne, Parkville, VIC 3052, Australia when parts of 
this work was done. S. V. Hanly is with the Department of Electrical and Computer Engineering, National University of Singapore, 
Singapore 117576. V. V. Veeravalli is with the Coordinated Science Laboratory and the Department of Electrical and Computer 
Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Email: vasanthan_raghavan@ieee.org, 
elehsv@nus.edu.sg, vvv@illinois.edu. * Corresponding author. 

This work has been supported in part by the ARC through grant DP-0984862, the NUS through grant WBS #R-263-000- 
572-133 and the NSF through grant CNS-0831670 at the University of Illinois. This paper was presented in part at the IEEE 
International Symposium on Information Theory, Austin, TX, 2010. 



SUBMITTED TO THE IEEE TRANSACTIONS ON INFORMATION THEORY, APRIL 2011 



2 



for the general broadcast setting with M antennas at the transmitter and A/-users (each with a single 
antenna) is obtained in terms of the covariance matrices of the links and the beamforming vectors. 

Index Terms 

Adaptive signalling, broadcast channel, information rates, MISO systems, multi-user MIMO, precoding, 
spatial correlation. 

I. Introduction 

The last fifteen years of research in wireless communications has seen the emergence of 
multi-antenna signalling as a viable option to realize high data-rates at practically acceptable 
reliability levels. While initial work on multi-antenna design was primarily motivated by the 
single-user paradigm [l]-[5], more recent attention has been on the theory and practice of multi- 
user multi-antenna communications [6]-[9]. The focus of this paper is on a broadcast setting 
that typically models a cellular downlink. We study the multiple-input single-output (MISO) 
broadcast problem where a central transmitter with M antennas communicates with M users in 
the cell, each having a single antenna. Under the assumption of perfect channel state information 
(CSI) at both the transmitter and the user ends, significant progress has been made over the last 
few years on understanding optimal signalling that achieves the sum-capacity [10]-[15] as well 
as the capacity region [16] of the multi-antenna broadcast channel. The capacity-achieving dirty- 
paper coding scheme [17] pre-nuUs interference from simultaneous transmissions by other users 
to a specific user and hence results in a multiplexing gain of M. 

Nevertheless, the high implementation complexity associated with dirty-paper coding [18] 
makes it less attractive in standardization efforts for practical systems. The consequent search 
for low-complexity signalling alternatives that are within a fixed power-offse10 of the dirty-paper 
coding scheme has resulted in an array of candidate linear (as well as non-linear) precoding 
techniques [19]-[27]. In particular, a linear beamforming scheme that is developed as a general- 
ization of the single-user beamforming scheme has attracted significant attention in the literature. 
Specifically, a scheme where the transmitter allocates one eigen-mode to each user and shares 
the power budget equally among all the users is the focus of this work. 

If perfect CSI is available at both the ends, instantaneous nulls can be created in the interference 
sub-space of each user (or interference can be zeroforced) and thus this scheme remains order- 
optimal with respect to the dirty -paper coding scheme [23]. However, the practical utility of the 
linear beamforming scheme is dependent on how gracefully its performance degrades with the 

'Two schemes are within a fixed power-offset if the difference in power level necessary to achieve a fixed rate with the two 
schemes stays bounded independent of the rate. 



SUBMITTED TO THE IEEE TRANSACTIONS ON INFORMATION THEORY, APRIL 2011 



3 



quality of CSI at the transmitter. This is because while reasonably accurate CSI can be obtained 
at the user end via pilot-based training schemes, CSI at the transmitter requires either channel 
reciprocity or reverse link feedback, both of which put an overwhelming burden on the operating 
cost [9]. In the extreme (and pessimistic) setting of no CSI at the transmitter, the multiplexing 
gain reduces to 1 (that is, it is lost completely relative to the perfect CSI case). 

In practice, the channel evolves fairly slowly on a statistical scale and it is possible to learn 
the spatial statistics^ of the individual links at the transmitter with minimal cost. With only 
statistical information at the transmitter, the interference cannot be nulled out completely and 
a low-complexity decoder architecture that treats interference as noise is often preferred. Initial 
works assume an identity covariance matrix for all the users corresponding to an independent 
and identically distributed (i.i.d.) fading process in the spatial domain [6]-[9]. However, this 
model cannot be justified in practical systems that are often deployed in environments where the 
scattering is localized in certain spatial directions or where antennas are not spaced wide apart 
due to infrastructural constraints [28]. 

While signalling design for the single-user setting under a very general spatial correlation 
model is now well-understood [l]-[5], the broadcast case where the channel statistics vary 
across users and different users experience different covariance matrices has not received much 
attention. In particular, [29] studies the problem where all the users share a common non-i.i.d. 
transmit covariance matrix and captures the impact of this common covariance matrix on the 
achievable rates. In [30], the authors show that second-order spatial statistics can be exploited 
to schedule users that enjoy better channel quality and hence improve the overall performance 
of an opportunistic beamforming scheme. In the same spirit, it is shown in [31] and [32] that 
second-order moments of the channel in combination with instantaneous norm (or weighted- 
norm) feedback is sufficient to extract almost all of the multi-user diversity gain in a broadcast 
setting. Spatial correlation is exploited to reduce the feedback overhead of a limited feedback 
codebook design in [33]-[36]. 

Summary of Main Contributions: With this background, the main focus of this paper is to fill 
some of the gaps in understanding the information-theoretic limits of broadcast channels with 
low-complexity signalling schemes (such as linear beamforming) under practical assumptions 
on CSI and decoder architecture. We study the simplest non-trivial version of this problem 
corresponding to the two-user (M = 2) case. We design optimal beamforming vectors on 
the Grassmann manifolcl^ Q{2, 1) to maximize the ergodic sum-rate achievable with the linear 

^With a Rayleigh (or a Ricean) fading model for the MISO channel, the complete statistical information of the link is captured 
by the covariance matrix (or the mean vector and the covariance matrix) of the vector channel. 

^Informally, G{M, 1) denotes the space of all A/-dimensional unit-norm beamforming vectors modulo the phase of the first 
element of the vector. A more formal definition is provided in Sec. HH (Def. [ll. 
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beamforming scheme. 

The first step to this goal is the computation of the ergodic sum-rate in closed-form. For 
this, we develop insight into the structure of the density function of the weighted-norm of 
beamforming vectors isotropically distributed on Q(2,l). Exploiting this knowledge, we derive 
an explicit expression for the ergodic sum-rate in terms of the covariance matrices of the users 
and the beamforming vectors. This expression can be rewritten in terms of a certain generalized 
"distance" measure between the beamforming vectors. As a result of this complicated non-linear 
dependence, the sum-rate is non-convex in the beamforming vectors thus precluding the use of the 
classical Lagrangian approach to convex optimization. Instead, a first-principles based technique 
is developed where the beamforming vectors are decomposed along an appropriately chosen 
(in general, non-orthogonal) basis. Exploiting this decomposition structure, we obtain an upper 
bound for the ergodic sum-rate, which we show is tight for a specific choice of beamforming 
vectors (see Theorems |2] and |3]). This optimal choice is the dominant generalized eigenvectoi0 of a 
pair of covariance matrices, with one of them being the covariance matrix of the forward link and 
the other an appropriately-designed "effective" interference covariance matrix. The generalized 
eigenvector structure is the solution to maximizing an appropriately-defined average signal-to- 
interference and noise ratio (SINR) metric for each user and thus generalizes our intuition from 
the single-user case [l]-[5]. Table I in the Conclusions section (SeclVTl) summarizes the structure 
of the optimal beamforming vectors under different signal-to-noise ratio (SNR) assumptions. 

While a generalized eigenvector solution has been obtained in the perfect CSI case for the 
multiple-input multiple-output (MIMO) broadcast problem [26], [37] and the MIMO interference 
channel problem in the low-interference regime [38], to the best of our knowledge, its appearance 
in the statistical setting is a first. A closely-related work of ours [39] reports the optimality of the 
generalized eigenvector solution for the statistical beamformer design in the MISO interference 
channel setting with two antennas. We also extend our intuition to the weighted ergodic sum- 
rate maximization problem [40] and conjecture on the structure of the optimal beamforming 
vectors. Numerical results justify our conjecture and the intuition behind it. Finally, closed-form 
expression for the ergodic sum-rate in terms of the covariance matrices of the links and the 
beamforming vectors are obtained in the general M-user case. 

Organization: This paper is organized as follows. With Section |ll] explaining the background of 
the problem, ergodic rate expressions in terms of the covariance matrices of the links and beam- 
forming vectors are obtained in Section |IlI] for the M = 2 case. The non-convex optimization 
problem of ergodic sum-rate maximization is the main focus of Section |IV] with the low- and 

''a generalized eigenvector generalizes the notion of an eigenvector to a pair of matrices. A more teclmical definition is 
provided in Sec. lIVK Def. l2t. The dominant eigenvector is the eigenvector corresponding to the dominant eigenvalue. Under the 
assumption that the eigenvector is unit-norm, it is unique on Q{M, 1). 
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the high-SNR extremes providing insight for the development in the intermediate-SNR regime. 
The focus shifts to weighted ergodic sum-rate maximization in Section |Vl In addition, sum-rate 
expressions are generalized to the general M-user case and concluding remarks are provided in 
Section |Vll Most of the proofs/details are relegated to the Appendices. 

Notation: We use upper- and lower-case bold symbols for matrices and vectors, respectively. 
The notations A and U are usually reserved for eigenvalue and eigenvector matrices whereas I 
is reserved for the identity matrix (of appropriate dimensionality). The z-th diagonal element of 
A is denoted by Aj while the i-th element of a vector x is denoted by x(i). At times, we also 
use Ai, A2, ■ ■ ■ to denote the eigenvalues of a Hermitian matrix, and these eigenvalues are often 
arranged in decreasing order as Ai > A2 > ■ • • . The Hermitian transpose and inverse operations 
of a matrix are denoted by (■)^ and (•)^^ while the trace operator is denoted by Tr(-). The 
two-norm of a vector is denoted by the symbol || ■ ||. The operator E[-] stands for expectation 
while the density function of a random variable is denoted by the symbol p(-). The symbols C 
and stand for complex and positive real fields, respectively. X ~ CJ\f{fi, cr^) indicates that 
X is a complex Gaussian random variable with mean fi and variance cr^. 



We consider a broadcast setting that models a MISO cellular downlink with M antennas 
at the transmitter and M users, each with a single antenna. We denote the M x 1 vector 
channel between the transmitter and user i as hi, i = 1, ■ ■ ■ , M. While different multi-user 
communication strategies can be considered [19]-[27], as motivated in Sec. H the focus here 
is on a linear beamforming scheme where the information-bearing signal Si meant for user i 
is beamformed from the transmitter with the M x 1 unit-norm vector Wj. We assume that Sj 
is unit energy and the transmitter divides its power budgejf] of p equally across all the users. 
Equal power allocation is popular in current-generation cellular standards where low-complexity 
schemes are preferred. The received symbol yi at user i is written as 



where p is the transmit power and denotes the CJ\f{0, 1) complex Gaussian noise added at 
the receiver. 

Initial works on the broadcast problem assume that is ergodic and it evolves over time and 
frequency in an i.i.d. fashion, and is spatially i.i.d. While the above assumption can be justified in 
the time and frequency axes with a frame-based and multi-carrier signalling approach (common 

^The practically motivated power-control problem where different powers could be allocated to the different users is a related 
problem, but it is not studied here. 



II. System Setup 




(1) 
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in current-generation systems), it cannot be justified along the spatial axis. This is because the 
channel variation in the spatial (antenna) domain cannot be i.i.d. unless the antennas at the 
transmitter end are spaced wide apart and the scattering environment connecting the transmitter 
with the users is sufficiently rich [28]. With this motivation, the main emphasis of this work 
is on understanding the impact of the users' spatial statistics on the performance of a linear 
beamforming scheme. 

We assume a Rayleigh fading§ (zero mean complex Gaussian) model for the channel, which 
implies that the complete spatial statistics are described by the second-order moments of {hj}. 
For the MISO model, the channel hj of user i can be written as 

hi = Sj/'hiid,j (2) 

where hiid.j is an M x 1 vector with i.i.d. CJ\f{0, 1) entries and "Si = E [hjhf^] is the transmit 
covariance matrix corresponding to user i. Note that ^ is the most general statistical model for 
hj under the MISO assumption. With Sj = I for all users, (|2]) reduces to the i.i.d. downlink 
model well-studied in the literature [6]-[9]. 

Under the assumption of Gaussian inputs {sj}, the instantaneous information-theoretic0 rate, 
Ri, achievable by user i with the linear beamforming scheme using a mismatcheci^ decoder [41] 
is given by 

/ ^ . |hf w,P \ 

S-^ ' ^ ' 

h. 1 li, 2 

With the spatial correlation model assumed in (|2]), we can write Jj i as 

ki = logfl + ^-h^^d.Sl/^fEw.wf^sl/^h,.,.) (5) 



= log(l + ^-h^^,^,V,A,Vfhiid,.), (6) 
where we have used the following eigen-decomposition in (|6]): 

V.A.Vf = Sl/^(^f;w,wfjEl/^ (7) 
Aj = diag([Aj,i, ■■■ , Aj^M]), A,,i > ■ ■ • > Aj,M > 0. (8) 

'while more general fading models such as Ricean or Nakagami-m models can be considered, this paper focuses on the 
Rayleigh model alone. 

^All logarithms are to base e and all rate quantities are assumed to be in nats/s/Hz in this work. 

^Here, the decoding rule is different from the optimal decoding rule due to the presence of multi-user interference. 
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Similarly, we can write Jj 2 as 

Ii,2 = log(l + ^-h^^d,iV,A,Vfhnd,.) (9) 

V.A.Vf = ^/'(j2^,^f]^y' (10) 

Ai = diag([A,,i, ■■■ , Ai,M]), A,,i > • ■ • > Ai,M > 0. (11) 

The goal of this work is to maximize the throughput conveyed from the transmitter to the 
users by the choice of beamforming vectors. Specifically, the metric of interest is the ergodic 
sum-rate, T^sum^ achievable with the linear beamforming scheme: 

M 

7^sum =5^^[i?*]. (12) 

i=l 

For this, note that the achievable rate in (|3]) is invariant to transformations of the form Wi t-^ e^^Wi 
for any 9. Coupled with the unit-norm assumption for Wi, the space over which optimization is 
performed is precisely defined as follows. 

Definition 1 (Stiefel and Grassmann Manifolds [42]): The uni-dimensional complex Stiefel 
manifold St(M, 1) refers to the unit-radius complex sphere in M-dimensions and is defined as 

St(M,l) = {x G C*^ : ||x|| = 1} . (13) 

The uni-dimensional complex Grassmann manifold Q{M, 1) consists of the set of one-dimensional 
subspaces of St(M, 1). Here, a transformation of the form x i-t- e^^x (for any 9) is treated as 
invariant by considering all vectors of the form e^^x (for some 9) to belong to the one-dimensional 
sub-space spanned by x. ■ 
The optimization objective is then to understand the structure of the beamforming vectors, 
{wi^opt}, that maximize T^sum: 

Wi,opt = arg max T^sum, i = 1, - ■■ ,M. (14) 

WiGg(A/,l) 

In (fT4l) . the candidate beamforming vectors, {wi}, depend only on the long-term statistics of 
the channel, which (as noted before) in the MISO setting is the set of all transmit covariance 
matrices, {Sj}. 

Towards the goal of computing T^sum, we decompose hnd j into its magnitude and directional 
components as hiid,i = ||hiid,i|| ■ hiid,i. It is well-known [1] that ||hiid,i|p can be written as 

^ 2M 

l|hiid,dP=2E^I (15) 
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where Xj is a standard (real) chi-squared random variable and h\\d,i is a unit-norm vector that 
is isotropically distributed [42], [43] on G{M, 1). Thus, we can rewrite Jj i and Jj 2 as 



k2 



log(l + ^-||hiid,.f ■h^^d,.V,A,Vfhi 



iid, i 



log 1 + 



_P_ 

M 



'■iid, i 



(16) 
(17) 



Since the magnitude and directional information of an i.i.d. random vector are independent [43], 
E 1] and E [Jj^ 2] can be further written as 



E[ki] 
E[k2] 



Ell 



Ell 



E- 



hiid, z 



P 



E, 



log 1 



_P_ 

M 



1 l|2 \ h 

^iid, ill ■ '■'■Wd i ■'■M -l^iid, j 



(18) 
(19) 



where we have also used the fact that a fixecO unitary transformation of an isotropically distributed 
vector on Q{M, 1) does not alter its distribution. 



III. Rate Characterization: Two-User Case 

We now restrict attention to the special case of two-users (M = 2) and focus on computing 
the ergodic information-theoretic rates given in (US) and ([T9l) in closed-form. The following 
theorem computes the ergodic rates as a function of the covariance matrices of the two links 
(5]i and S2), and the choice of beamforming vectors {wi and W2). 

Theorem 1: The ergodic information-theoretic rate achievable at user i (where i = 1,2) with 
linear beamforming in the two-user case is given as 

' pAi, 2 



E [R,] = E [J,,i] - E [J,,2] = T- - h 

where h{») is a monotonically increasing function defined as 



h{x) = exp ( — j El (— ] , X G (0, 00) 



(20) 



(21) 



with Ei{x) = denoting the Exponential integral [44]. The corresponding eigenvalues 

(cf. ^ and (flOl) ) can be written in terms of Sj and the beamforming vectors as follows: 



Ai,i 

Ai,2 

Ai,i 



2 



Bi, 



(22) 

(23) 
(24) 



'Note that the unitary transformation is independent of the channel realization when the beamforming vectors are dependent 
only on the long-term statistics of the channel. 
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where = wf^SjWj, Bi = wj^SjWj and Cj = |wf^SjWj| with j ^ i and {i,j} = 1, 2. 

Proof: Since E [Ri] = E [Jj. i] — E [Ii^2], we start by computing E [/j, i]. From (fTSl ). we 

have 

fAi, 1 



log ( 1 + ^ ■ xy)pi{y)dy 



(25) 



'2/=Ai,2 

where X stands for the random variable X = ||hiid,i|p, a; is a realization of X and Pi{y) denotes 
the density function of 

hHd,i^ihiid,i = Ai,j hiid,^(j) , (26) 
i=i 

evaluated at y with Aj 2 < < 1. That is, a closed-form computation of E [Jj 1] requires the 
density function of weighted-norm of vectors isotropically distributed on Q(2,l). In Lemma [T] 
of Appendix lAl we show that 

1 



Piiy) 



Ai,2 <y< Ai,i. 



A»,i - Ai,2 

Using this information along with the chi-squared structure of ||hiid,ip (see (fTSl)). we have 



(27) 



E[I, 



Ai,i - A,, 2 
Integrating out the y variable, we have 

1 



xe 



x=Q 



log ( 1 + f^xy ) dydx. 



(28) 



E[I, 



i,l 



(Ai,i - Ai,2) 

1 



f (A,,i-A 



i,2, 



{1 + ^Ai^ix) ■ log {l + l^i^ix) ■ e ''(ia; 
■ ^ (1 + |Ai,2x) • log (1 + ^A,„2x) ■ e-^rfx - 1. (29) 



Following a routine computation using the list of integral table formula [45, 4.337(2), p. 572], 
we have the expression for E 1] . Particularizing this expression to the case of E [Jj^ 2] in (fT9l ) 
with Aj 2 = results in the rate expression as in the statement of the theorem. To complete 
the proposition, an elementary computation of the eigenvalues of the associated 2x2 matrices 
in (|7]) and (flOl) results in their characterization. ■ 
The increasing nature of h{»), defined in (|2T)) . is illustrated in Fig. [Tl Towards the goal of 
obtaining physical intuition on the structure of the optimal beamforming vectors, it is of interest 
to obtain the limiting form of the ergodic rates in the low- and the high-SNR extremes. 



A. Low-SNR Extreme 

Proposition 1: The ergodic rate E [Ri] can be bounded as 



"^-^flA^rrfcySl + ^C^, (30) 
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Fig. 1. The behavior of h{x) for x satisfying < a; < 25. 



for some positive constants Cup and C|ow (not provided here for the sake of brevity) that depend 
only on the eigenvalues Aj^i, Aj^2 and Aj^i. Thus, as p -> 0, we have 

^^^'^ l(A,i + A,2-5.) (31) 



P 2 

A; W,^5],;Wi 



2 2 



(32) 



Proof: We need the following bounds on the Exponential integral [44, 5.1.20, p. 229]: 

^ < ^ log (1 + 2x) < h{x) < log (l + x) <x (33) 



1 + 2x - 2 

where the extremal inequalities are established by using the fact that 

X 



< log(l + x) <x. (34) 

X + I 

Using these bounds, it is straightforward to see that the relationship in (l30l) holds. Note that 
both the upper and lower bounds converge to the same value as p — )■ 0, which results in the 
simplification in (l32l) . ■ 
In the low-SNR extreme, the system is noise-limited, hence the linear scaling of E [i?,] with p. 
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B. High-SNR Extreme 

Proposition 2: As p — > oo, we have 



p^po Ai,ilog(A,,i) - Aj,2log(Aj,2) 



E[Ri\ log(5i) (35) 



log 



+ 2^°H^^J (36) 

with Aj, _Bi and Cj as in Theorem [T] 

Proof: The following asymptotic expansion [44, 5.1.11, p. 229] of the Exponential integral 
is useful in obtaining the limiting form of E [R.-] as p — )• oo: 

^ ^ k=l 



log J + a; - 7 (38) 

where 7 ^ 0.577 is the Euler-Mascheroni constant. Using the limiting value of Ei{x) to 
approximate E [i?,], we have the expression in (1351) . Expanding Aj 1 and Aj 2 in terms of Ai, Bi 
and Cj, we have the expression in (|36l ). ■ 
Unlike the low-SNR extreme, E [Ri] is not a function of p here. The dominating impact of 
interference (due to the fixed nature of the beamforming vectors that are not adapted to the 
channel realizations) and the consequent boundedness of E [Ri] in (l35l) as p increases should 
not be surprising. 



IV. Sum-Rate Optimization: Two-User Case 

We are now interested in understanding the structure of the optimal choice of beamforming 
vectors (wi,opt, W2,opt) that maximize T^sum as a function of Si, S2 and p. This problem is 
difficult, in general. To obtain insight, we first consider the low- and the high-SNR extremes 
before studying the intermediate-SNR regime. 

For simplicity, let us assume an eigen-decomposition for Si and S2 of the form 

51 = Udiag([Ai(Si), A2(Si)]) U^, (39) 

52 = Udiag([Ai(S2), A2(S2)]) U^, (40) 

where U = [ui(Si), U2(Si)], U = [ui(S2), U2(S2)], and Ai(S,) > A2(S,), 2 = 1,2. In 
particular, we assume that both Si and S2 are positive-definite, that is, A2(Si) > 0. 
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A. Low-SMR Extreme 

Proposition 3: In the low-SNR regime, from Prop. [T]we see that the maximization of E [Ri] 
involves optimizing over Wj alone. Thus, we have 

Wi,opt = arg maxT^sum = arg maxwf S^Wj = e^'''ui(Si) (41) 

for some choice of ut G [0, 2tt), i = 1,2. The resulting ergodic sum-rate satisfies 

TZ 1 



lim 

P^o p 2 



(42) 



In the low-SNR extreme, the optimal solution is such that the transmitter signals to a given 
user along the dominant statistical eigen-mode of that user's channel and ignores the other 
user's channel completely. This is a solution motivated by the single-user viewpoint where the 
optimality of signalling along the dominant statistical eigen-mode of the forward channel is well- 
known [l]-[5]. This solution is not surprising since in the noise-limited regime, the broadcast 
channel is well- approximated by separate single-user models connecting the transmitter to each 
receiver. 



B. High-SNR Extreme 

Define S (and its corresponding eigen-decomposition) as 

S 4 S^"^ Si S^"^ = V diag ([r/i r/2]) (43) 

where V = [vi V2] and r/i > 772- Note that S is positive-definite (772 > 0) since both Si and S2 
are positive-definite. The main result of this section is as follows. 

Theorem 2: In the high-SNR extreme, the ergodic sum-rate is maximized by the following 
choice of beamforming vectors: 

wi,opt = e^"' ■ , W2,opt = e^""' ■ (44) 

11^2 ^ "^lll 11^2 ^ "^211 

for some choice of Ui E [0, 2n), i = 1,2. The optimal ergodic sum-rate satisfies 

lim 7J.„ = ^iiHiM + M^2) (45) 

p->oo Ki — i K2 — i- 



where 



A '71T-2 A 7-2 

Ki = , K2 = —, (46) 

^?2n Tl 

ri=vfS2-'vi, r2 = V2^S2-'v2, r3 = vfS2-'v2. (47) 
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Proof: The first step in our proof is to rewrite the high-SNR rate expression in a form that 
permits further analysis. This is done in Appendices |B]and O With the definition of {vi, V2} 
as in (|43] ). since S2 is full-rank, we can decompose wi and W2 as 

||aS2 ' vi + /3S2 ' V2II 

II7E2 ' vi + 5S2 ' V2II 

for some choice of {a,/3,7,5} with a = \a\e^^" (similarly, for other quantities) satisfying 
+ = I7P + |5p = 1. In Appendix iDl we show that the ergodic sum-rate optimization 
over the six-dimensional parameter space {\a\, \'j\,9a,0i3,9^,9s} results in the choice as in the 
statement of the theorem. ■ 
Many remarks are in order at this stage. 
Remarks: 

1) Recall the definition of a generalized eigenvector: 

Definition 2 (Generalized eigenvector [46]): A generalized eigenvector x (with the cor- 
responding generalized eigenvalue o) of a pair of matrices (A, B) satisfies the relationship 

Ax = aBx. (50) 

In the special case where B is invertible, a generalized eigenvector of the pair (A, B) is 
also an eigenvector of B~^ A. If A and B are also positive-definite, then all the generalized 
eigenvalues are positive. While a unit-norm generalized eigenvector (or an eigenvector) is 
not unique on St(M, 1), it is unique on Q{M, 1). ■ 
We decompose {wi, W2} in (|48l) -(l49l) along the basiil^ {Sg ^ vi, S2 ^ V2} instead of the 
more routine basis |vi, V2}. The reason for this peculiar choice is as follows. It turns out 
that S2 ^ vi and S2 ^ V2 are the dominant generalized eigenvectors (corresponding to the 
largest generalized eigenvalue) of the pairs (Si, S2) and (S2, Si), respectively. For this 
claim, we use (l43l) to note that 



S2^Si = S2^ (S2^SiS2^)S| = MDM"^ (51) 

S^^S2 = (S^^Si)"^ = MD^^M"^ (52) 
where M = Sg V and D = diag([?7i 772]) . This means that we can write 

wi,opt = e^'"^-ui(S2^Si) (53) 

W2,opt = e^'^^-U2(S2'Si) (54) 

'"Note that S2 is a full-rank matrix and hence, the vectors ^ vi and Eg ^ ^2 form a non-orthogonal basis (in general), 
whereas {vi,V2} is orthonormal. 
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where ui(«) and U2(») are the dominant and non-dominant eigenvectors, respectively. 
Using the generalized eigenvector structure, it is easy to see that 

ui(S2-^Si) = U2(Sr'S2) (55) 

U2(S2"^Si) = ui(Sr'S2) (56) 

and thus 

wi,opt = e^"' ■ u,{^^'^,) (57) 

W2,opt = e-''^^-ui(S7iS2). (58) 

2) Given that the transmitter has only statistical information of the two links, a natural 
candidate for beamforming in the high-SNR extreme is the solution to the maximization of 
an appropriately-defined average SINR metric for each user. Motivated by the fact (see dS])) 
that the instantaneous sum-rate for the i-th user (Ri) is an increasing function of |hf^Wjp 
whereas Rj (for j 7^ i) is a decreasing function of |h|^Wip, we define an "average" SINR 
metric as follows: 

E[\hfwi\^\ wfSjW, 

The optimization problem of interest is to maximize SINRj which has the generalized 
eigenvector structure as solution [47]: 

arg max SINR, = e^"' ui (StIs,) , j ^ z, {i,j} = 1,2. (60) 

Wi : Tivfwi = l 

It follows that if user i selfishly maximizes (its own) SINRj metric, then the set of 
such beamforming vectors maximize the ergodic sum-rate in the high-SNR regime. In 
this sense, the solution to the broadcast problem mirrors and generalizes the single-user 
setting, where the optimality of signalling along the statistical eigen-modes of the channel 
is well-understood [l]-[5]. Further, while optimal beamformer solutions in terms of the 
generalized eigenvectors are obtained in the perfect CSI case of the broadcast setting for 
the beamforming design problem [26], [37] and the interference channel problem [38], to 
the best of our knowledge, this solution in the statistical case is a first. A similar result 
is obtained in a related work of ours [39] on statistical beamforming vector design for 
the interference channel case. Since the generalized eigenvector solution has an intuitive 
explanation, it is of interest to obtain useful insights on the optimality of this solution in 
more general multi-user settings. 

3) The ergodic sum-rate in (|45T ) is increasing in ki and thus in ^. We now observe that 
ill-conditioning of Si is necessary and sufficient to ensure that ^ is large. For this, we 
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use standard eigenvalue inequalities for product of Hermitian matrices [47] to see that 

XI _ Ai(Si) ■ A2(S,-^) ^ r/i ^ Ai(Si) ■ Ai(S, ^) _ 

X2 A2(Si) ■ Ai(S2-i) - - A2(Si) ■ X^i^^') 
where = ^^^^'^ , i = 1, 2. In other words, the more ill-conditioned Si is, the larger the 
high-SNR statistical beamforming sum-rate asymptote is (and vice versa). 
On the other hand, the ergodic sum-rate in (|45T ) is not monotonic in ^. Nevertheless, 
it can be seen that as a function of — , it has local maxima as — — )■ and — — )■ oo, 
and a minimum at — = 1. The more well-conditioned So is, the more closer — is to 
1 and hence, the high-SNR statistical beamforming sum-rate asymptote is minimized. If 
S2 is ill-conditioned, the value taken by ^ depends on the angle between the dominant 
eigenvectors of S2 and S. If the two eigenvectors are nearly parallel, ^ is close to zero 
and if they are nearly perpendicular, ^ is very large. In either case, the high-SNR statistical 
beamforming sum-rate asymptote is locally maximized. 

The conclusion from the above analysis is that among all possible channels, the ergodic 
sum-rate is maximized (or minimized) when Si and S2 are both ill- (or well-)conditioned. 
In other words, if both the users encounter poor scattering (that leads to an ill-conditioning 
of their respective covariance matrices), their fading is spatially localized. The transmitter 
can simultaneously excite these spatial localizations without causing a proportional increase 
in the interference level of the other user thus resulting in a higher ergodic sum-rate. On 
the other hand, rich scattering implies that fading is spatially isotropic for both the users. 
Any spatially localized excitation for one user will cause an isotropic interference level at 
the other user thus resulting in a smaller ergodic sum-rate. 
4) A special case that is of considerable interest is when Si and S2 have the same set of 
orthonormal eigenvectors. This would be a suitable model for certain indoor scenarios 
where the antenna separation for the two users is the same [28]. Denoting (for simplicity) 
the set of common eigenvectors by ui and U2, we can decompose Si and S2 as 

51 = [ui,U2] diag([Ai, A2]) [ui,U2]^, (62) 

52 = [ui,U2] diag([/ii, /i2]) [ui,U2]^. (63) 
We re-use the notations xi X2 to denote 

Xi = — and X2 = —■ (64) 

A2 /i2 |_. 

Without loss in generality, we can assume that Xi > 1. Two scenarioo arise depending 
on the relationship between xi and X2: i) Xi > X2, and ii) Xi < X2- 



"These possibilities arise because even though the set of eigenvectors of Si and S2 are the same, there is no specific reason 
to expect the dominant eigenvector of Si to also be a dominant eigenvector of S2. Observe that the first case subsumes the 
setting where /ii = /i2 = /i and S2 — /il. 
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Theorem 3: In the high-SNR extreme, the ergodic sum-rate is maximized by the following 
choice of beamforming vectors: 

wi,opt = e^'^iui, W2,opt = e^'^^ua if Xi > X2, 

(65) 

wi,opt = e^'^^u^, W2,opt = e^'^iui if Xi < X2 
for some choice of z/j G [0, 2tc), i = 1,2. The optimal ergodic sum-rate satisfies 



Proof: While Theorem|2]can be particularized to this special case easily, we pursue an 
alternate proof technique in Appendix |E] that exploits the comparative relationship between 



A comparison of the proof techniques of Theorems |2]and |3]is presented in Appendix IB 

5) Some remarks on the optimization set-up of this paper are necessary. The proofs of Theo- 
rems [2] and |3]require us to consider a six-dimensional optimization over the parameter space 
of {|a|, \'y\,9a,0i3,9.y,9s}. As a result, any geometric interpretation of the optimization is 
impossible. A naive approach to the six-dimensional optimization problems in this paper 
is to adopt the method of matrix differentiation calculus. For this approach to work, we 
need to show that both the set over which optimization is done as well as the optimized 
function are convex, neither of which is true in our case. Specifically, neither Q{2,1) nor 
St(2, 1) are convex sets. It also turns out that the ergodic sum-rate is neither convex nor 
concave0 over the set of beamforming vectors, even over a locally convex domain or an 
extended convex domain (like the interior of the sphere). 

6) The approach adopted in Appendices iDl and lEl overcomes these difficulties, and it consists 
of two steps. In the first step, we produce an upper bound to the ergodic sum-rate that is 
independent of the optimization parameters. In the second step, we show that this upper 
bound can be realized by a specific choice of beamforming vectors thereby confirming 
that choice's optimality. This approach seems to be the most natural (and first principles- 
based) recourse to solving the non-convex optimization problem at hand. An alternate 
approach to optimize the ergodic sum-rate is non-linear optimization theory [48]. But this 
approach is fraught with complicated Hessian calculations and technical difficulties such 
as distinguishing between local and global extrema. 

'"It is possible that some function of the ergodic sum-rate may be convex. But we are not aware of any likely candidate that 
could work. 




(66) 



Ti and T2 (which is possible in the special case) and the fact that = 0. 
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C. Intermediate-SHR Regime: Candidate Beamforming Vectors 

While physical intuition on the structure of the optimal ergodic sum-rate maximizing beam- 
forming vectors has been obtained in the low- and the high-SNR extremes, the essentially 
intractable nature of the Exponential integral in the ergodic rate expressions of Theorem [T] means 
that such a possibility at an arbitrary SNR is difficult. Nevertheless, the single-user set-up [49], 
[50] suggests that the optimal beamforming vectors (that determine the modes that are excited) 
and the power allocation across these modes can be continuously parameterized by a function 
of the SNR. Motivated by the single-user case, a desirable quality for a "good" beamforming 
vector structure ({wj^cand(p), ^ = 1, 2}) at an arbitrary SNR of p is that the limiting behavior of 
such a structure in the low- and the high-SNR extremes should be the solutions of Prop. |3]and 
Theorem |2l That is, 

limw,,,3nd(p) = e^'^'ui(S,), (67) 
lim w,,cand(p) = e^'^'^C^, ^ = 1,2, (68) 

11^2 

where the above limits are seen as manifold operations [42] on Q{2, 1). 

A natural candidate that meets (|67T ) and (1681) is the following choice parameterized by a{p) 
and (5{p) satisfying {a{p), /3(p)} G [0, C)o) and z/j G [0,27r), z = 1, 2: 

wi,cand(p) = e^'^^-Dom.eig. ((a(p)S2 + l)"'Si) (69) 

W2,cand(p) = e^'^^ • Dom. eig. ( (/3 (p) Si + l) S2) (70) 

where the notation Dom.eig(«) stands for the unit-norm dominant eigenvector operation. These 
vectors can be seen to be solutions to the following optimization problems: 



where 



arg max SINRj = Wi,cand(p), « = 1, 2 (71) 

Wi : wi^Wi=l 

SINRi = -jr ' , ' ' ^ (72) 

Wf Wi + Oi[p) Wf 2J2W1 

SINR2 = ^ ^"I:"^"h^ ■ (73) 

Vi^2 W2 + P (p) ^2 ^1 W2 

The choice in (l69l)-(l70l) is a low-dimensional mapping from Q{2, 1) x Q{2, 1) to M+ x M+ thus 
considerably simplifying the search space for candidate beamforming vectors. It must be noted 
that while the search space is simplified, the generalized eigenvector operation is a non-linear 
mapping [46] in a{p) and /3(p). 
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D. Numerical Studies 



We now study the ergodic sum-rate performance with wi and W2 as in (|69l) -(f70l) via two 
numerical examples. In the first study, we consider a system (note that Tr(Si) = Tr(S2) = 
M = 2) with 



Si 



1.7745 
-0.5178 - 0.0247i 



-0.5178 + 0.02472 
0.2255 



1.2522 -0.8739 - 0.2711Z 

-0.8739 + 0.2711i 0.7478 



(74) 



(75) 



Fig. [2ta) shows the ergodic sum-rate as a function of p for four schemes. For the first scheme, 
for every p, an optimal choice {a*(p), /3*(p)} is obtained from the search space a{p) x /3(p) G 
[0, oo) X [0, oo) as follows: 



arg max E [Ri] + E [R2] 

S.t. Wi = Wi,cand(p), W2 = W2,cand(p)- 



(76) 



The performance of the beamforming vectors with Q*{p) and /3*(p) for every p is plotted 
along with the performance of the candidate obtained via a numerical (Monte Carlo) search 
over Q{2, 1) x Q{2, 1). As motivated in the prior discussion, while we expect the performance 
with {a*(p), /3*(p)} to be good, it is surprising that this choice is indeed optimal. Further, the 
performance of a set of beamforming vectors with a{p) = (3{p) = and a{p) = 100, /3(p) = 15 
(fixed for all p in (|69l)- (r70l) ) are also plotted. Observe that these two sets approximate the low- 
and the high-SNR solutions of Prop. |3]and Theorem |2l respectively. 

In the second study, we consider a system (again, note that Tr(Si) = Tr(S2) = M = 2) with 



Si 



1.3042 0.0543 - 0.2540z 

0.0543 + 0.2540Z 0.6958 



1.1161 

-0.2195 - 0.4340i 



-0.2195 + 0.4340Z 
0.8839 



(77) 



(78) 



Fig. [2tb) plots the performance of the proposed scheme, the low- and the high-SNR solutions in 
addition to the candidate obtained via a numerical search over ^(2, 1) x ^(2, 1). As before, the 
low- and the high-SNR solutions are optimal in their respective extremes while the candidate 
{a* (p) , l3* (p)} is optimal across all p. 

Note that in Fig. [2ta) there exists an SNR-regime where both the low- and the high-SNR 
solutions are sub-optimal. In contrast, in Fig. l2lb), the high-SNR solution essentially coincides 
with the numerical search for all p whereas at the low-SNR extreme, the performance of the 
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low-SNR solution is as expected. We now explain why the high-SNR solution performs as well 
as {a* (p) , (3* (p)} for all p. For this, we need to understand the behavior of the angle between 
the proposed set of beamforming vectors in (|69])-(|70T| and the low-SNR solution as a function of 
a and /3. In Fig. Sb), we plot cos {Ang\ei{a{p))) as a function of a{p) and cos (Angle2(/3(p))) 
as a function of /3{p) where 



cos (Anglei(a(p))) 
cos (Angle2(/3(p))) 



(^Dom.eig. (^(a(p)S2 + l) ^ Si))^ Dom. eig. (S 



H 



(79) 
(80) 



^Dom.eig. (^(/3(p)Si + l) 'Sajj" Dom.eig.(S2; 

From Fig. Hb), we note that the chordal distance^ between the low- and the high-SNR solutions 
is small (on the order of 0.05). Also, observe that there is a quick convergence of (|69])- (|70|) 
as a (or f3) increases to the high-SNR solution and hence the high-SNR solution is a good 
approximation to the choice {a* (p) , (3* (p)} over a large SNR range. On the other hand, from 
Fig. Ha), we see that the chordal distance between the low- and the high-SNR solutions is large 
(on the order of 0.90), which translates to the sub-optimality gap in Fig. HI a). 

While we are unable to prove the optimality structure of the proposed scheme in the intermediate- 
SNR regime, motivated by our numerical studies, we make the following conjecture. 

Conjecture 1: In the intermediate-SNR regime, the ergodic sum-rate is maximized by the 



'^^In short, the chordal distance is the square-root of the difference of 1 and the square of the quantity computed in ( I79t 
(or ( I80t ). See ( I136t for more details. 
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(b) 



Fig. 3. Anglei and Angle2, defined in (I79b- i80b . as a function of a and /3 for the setting in: (a) ( I74b -( l75t . (b) ( I77t -(|78 



following choice of beamforming vectors: 

wi,opt = e^^^i-Dom.eig. ((a*(p)S2 + l)"'Si) (81) 

W2,opt = e^'^^-Dom.eig. ((r(p)Si + l)"'S2) (82) 

for some choice of Vi E [0,27r), i = 1,2. The notation Dom.eig(«) stands for the unit-norm 
dominant eigenvector operation and 



{a*{p), f3*{p)} = arg max E [Ri] + E [R^] 

{"{P),/3{P)} 



S.t. Wi = Wi,cand(p), W2 = W2,cand(p)- 



(83) 



V. Ergodic Sum-Rate: Generalizations 

We studied the structure of ergodic sum-rate maximizing beamforming vectors in Sec. |IV] In 
this section, we consider more general problems of this nature. 



A. Maximizing E [Ri] 

Consider a system where the Quality-of-Service metric of one user significantly dominates that 
of the other user. For example, one user is considerably more important to the network operator 
than the other. The relevant metric to optimize in this scenario is not the ergodic sum-rate, but 
the rate achievable by the more important user. In this setting, we have the following result. 
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Proposition 4: The optimal choice of the pair (wi opt, W2_opt) that maximizes E [Ri] is: 

i) Low -SHR Extreme: 

Wj,opt = e^''' ui (Sj) and w^- opt = any vector on ^(2, 1), j ^ i (84) 

ii) High-SHR Extreme: 

Wi,opt = e^''' ui (Si) and w^- opt = e-^'"' ^2 (S^) , j (85) 

for some choice of z/j G [0, 2n), i = 1,2. 

Proof: See Appendix [Gl ■ 
With the above choice of beamforming vectors, E [Rj] can be written as 

Em ^iMM (86) 

Xi - 1 

where Xi = J^^^- From (l86l) . it is to be noted that E [Ri] increases as Xi increases. That is, 
the more ill-conditioned Sj is, the larger the high-SNR statistical beamforming rate asymptote 
is (and vice versa). This should be intuitive as our goal is only to maximize E [Ri] and the 
beamforming vectors in (|85T ) achieve that goal. 



B. Weighted Ergodic Sum-Rate Maximization 

In a system where the Quality-of-Service metrics of the two users are comparable (but not 
the same), the relevant metric to optimize is the weighted-sum of ergodic rates achievable by 
the two users [40]. Specifically, the objective function here is 

"^weighted = ClE [Ri] + C2E [R2] (87) 

for some choice of weights Ci and ^2 satisfying (without loss in generality) {^1, ^2} ^ [0, 1]. 
Note that E is a special case of this objective function with ^1 = 1, ^2 = or ^1 = 0, C2 = 1- 
Maximizing T^-^eighted to obtain a closed-form characterization of the optimal beamforming 
vectors seems hard in general. Motivated by the study for the sum-rate in the intermediate-SNR 
regime in Sec. HVl we now consider a set of candidate beamforming vectors that produce known 
optimal structures in special cases. For this, it is important to note that no choice of a{p) and 
/9(p) in (I69l) -(r70l) can produce the beamforming vectors in (|85T ). A candidate set of beamforming 
vectors that not only produces the special (extreme) cases in the ergodic sum-rate setting, but 
also (l85l) is the following choice parameterized by four quantities, {a;(p), /3(p), 7(p), 5(p)} G 
[0,00): 

wi, weighted, cand (p) = e^""' " Dom.eig. ( (« (p) S2 + l) (7(p)Si +l)) (88) 
W2, weighted, cand (p) = c^'^^ • Dom . eig. ( (/3 (p) Si + l) (5(p) S2 + l) ) (89) 
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where the notation Dom.eig(«) stands for the usual unit-norm dominant eigenvector operation. 
As before, (|88T)-(l89l) corresponds to a low-dimensional map from {g{2,l)}^ to {[0,oo)}^ and 
thus a simplification in the search for a good beamformer structure. 




p (in dB) 



Fig. 4. Weighted ergodic sum-rate of proposed sclieme witli Si and S2 as in ll74t-(l75t. 

We now study the performance of the proposed beamforming vectors in (|88l) - (|89] ) for the 
system with Si and S2 as in (I74l)-(r75l). Two sets of weights are considered: i) ^1 = 1, ^2 = 0.5 
and ii) Ci = 0.2, (^2 = 0.8. Fig. |4] plots the performance of two schemes. The first scheme 
corresponds to a Monte Carlo search over Q{2, 1), whereas the second scheme corresponds to 
the use of an optimal choice {a*(p), /3*(p), 7*(p), (5*(p)} (for every p) with 

{a^(p), r(p), y(p), 5'ip)} 

= arg max E [Ri] + E [R2] 

{q(p),/3(p),7(p),'5(p)} 

S.t. Wi = Wi weighted, cand(p)) W2 = W2, weighted, cand (p) • (90) 

As can be seen from Fig. HI the proposed scheme in (|88T )-(|89]) performs as well as the Monte 
Carlo search for both sets of weights. Numerical studies suggest that similar performance is seen 
across all possible Si and S2, and all possible weights Ci and ^2- Motivated by these studies, 
we pose the following conjecture. 

Conjecture 2: In the intermediate-SNR regime, the weighted ergodic sum-rate, 7?.weighted, is 
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maximized by the following choice of beamforming vectors: 

wi = e^"' ■ Dom.eig. ((a*(p)S2 + (7*(p)Si +l)) (91) 

W2 = e^'^^-Dom.eig. ((/3^(p)Si + l)"'((5^(p)S2 + l)) (92) 
for some choice of Vi E [0, 27r), i = 1, 2 and where {a*{p), /3*(p), 7*(p), 5*{p)} are as in (|90l) . 



C. Rank-Deficient Case 

Following up on Remark 3 in Sec. |IV-B[ we now consider the extreme case where both0 Si 
and S2 are rank-deficient in more detail. 

Proposition 5: The ergodic information-theoretic rate achievable at user i is 

Em = /^(fAi(S,) (|ui(l]0''w,r + |ui(S,)%r))-/i(fAi(S,)|ui(S,)%r), 

jV^, ^ = 1,2 (93) 

where h(») is as in (|2T]) and the eigen-decomposition of Sj is 

= Ai(S,)-ui(S,)ui(S,)'', ^ = 1,2. (94) 

Proof: While Theorem [T] (as stated) is explicitly dependent on both Si and S2 being of full 
rank and is hence not directly applicable in this extreme setting, much of the analysis follows 
through. The key to the proof is that all the results in Appendix |A] (Lemmas [T] and ^ also hold 
when some of the diagonal entries of Aj are zero. In fact, this fact is implicitly used to compute 
E [/i,2] in Theorem \T\ U 

D. Three-User Case: M = 3 

We now consider the task of generalizing Theorem [1] to the three-user (M = 3) case. 
Proposition 6: The ergodic information-theoretic rate achievable at user i (where i = 1,2,3) 
with linear beamforming in the three-user case is 

E[Ri] = E[h,,]-E[Ii,2] 



Af,2-M^) ^ Ks-h' '''''' 



( Ai, 1 - A,, 2) ( Ai, 1 - Ai, 3) ( Ai, 1 - A,, 2) ( Ai, 2 - A^, 3) ( A^, 1 - A^, 3) (A,, 2 - A^,, 3) 
Al,.h(^) Al,.h(e^) K,-h(^^^^ 



(^Ai, 1 - Ai, 2) (a,, 1 - Ai, 3) {hi^ 1 - Ai, 2) (a,, 2 - Ai, 3) {Ki^ 1 - A^, 3) 2 - A^, 3) 

(95) 



'''The case when only one of the S; is rank-deficient can be studied along analogous lines and no details are provided. 
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where h{») is as in (|2TI) . The eigenvalue matrices Aj = diag([Aj i, Aj 2, -^1,3]) and Aj = 
diag([Aj 1, Aj 2, -^1,3]) arc defined as in (|7]) and (fTOl ). and can be obtained in terms of the 
beamforming vectors and the co variance matrices by solving the associated cubic equations. 

Proof: The proof is tedious, but follows along the lines of Theorem [T] The first step is in 
characterizing Pi{y), which is done in Lemma [2] of Appendix lAl We can then generalize (l28l) 
using (|12H) as 

^^^''^^ (Ai,i - Ai,2) (A,,i - Ai,3) ^ (A,,i - A,,3) (Ai,2 - Ai,3) ^^^^ 



1^ = / x^e-^ y log [1 + ^ Ai^ ix - !^xy)dydx (97) 

Jx=0 Jy=0 V O 6 / 

12=/ X V" /" ' ' y\og(l + ^Ai^3X+^xy)dydx. (98) 

Jx=0 Jy=0 V O 6 / 

These integrals are cumbersome, but straightforward to compute using [45, 4.337(2), 4.337(5), 
p. 572]. The result is the expression in the statement of the proposition. ■ 



E. General M-User Case 

As can be seen from Appendix lAl (Lemma |2)). the expression for Pi{y) becomes more compli- 
cated as M increases. Without a recourse to Pi{y), closed- form expressions for the ergodic sum- 
rate of the linear beamforming scheme can be obtained using a recent advance in [31], [32] that 
allows the computation of the density function of weighted-sum of standard central chi-squared 
terms (generalized chi-squared random variables). For example, if A/ = diag([Aj 1, ■ ■ ■ , Aj^ m]) 
and Aij, j = 1, - ■ ■ , M are distinc10, we have 

MM . / A \ 

k=i j=i,jj^k ^'-^ ^ ^ 

For [Jj 2], we replace {Aj_fc} with {Ajfc}. It can be checked that these expressions match 
with the expressions in this paper for the M = 2 and M = 3 settings. Nevertheless, it is 
important to note that the formulas in (|95T ) and (|99] ) are in terms of the eigenvalue matrices 
{Ai,Ai,i = 1, ■ ■ ■ , M}, which become harder (and impossible for M > 5) to compute in 
closed-form as a function of the beamforming vectors and the transmit covariance matrices as M 
increases. Tractable approximations to the ergodic sum-rate and beamforming vector optimization 
based on such approximations are necessary, which is the subject of ongoing work. 

"More complicated expressions can be obtained in case {Aij} are not distinct. These expressions can be derived in a 
straightforward manner using the resuhs in [31]. 
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VI. Concluding Remarks 

This paper considered the design of statistical beamforming vectors in a MISO broadcast 
setting to maximize the ergodic sum-rate. The approach pursued here for the simplest non-trivial 
problem with two-users is as follows: first, the beamforming vectors are fixed and ergodic rate 
expressions are computed in closed-form in terms of the covariance matrices of the links and 
the beamforming vectors. The optimization of this non-convex function results in a general- 
ized eigenvector structure for the optimal beamforming vectors, the solution to maximizing an 
appropriately-defined SINR metric for each user. This structure generalizes the single-user setup 
where the dominant eigen-modes of the transmit covariance matrix of the links are excited. The 
main results of this paper are presented in Table I for different SNR (p) assumptions where we 
use ui(«) and U2(») to denote the dominant and sub-dominant eigenvectors of the matrix under 
consideration. 

Possible extensions of this work include unifying the special case of Theorem |3] with the 
general case of Theorem |2l and proving Conjectures [T] and |2] Developing intuition in the three- 
user case as well as tractable approximations in the general M-user (M > 2) case critically 
depend on exploiting the functional structure of the ergodic sum-rate expression. The generalized 
eigenvector solution has been seen in other multi-user scenarios as well, for example, the 
interference channel problem with two antennas [39]. Generalizing the theme developed in the 
broadcast setting to the interference channel setting, the Rayleigh case to the Ricean case, and 
the perfect CSI case to the case where only statistical information is available are all important 
tasks. 



Table I: Structure of Optimal Beamforming Vectors 


Objective Function: 


arg max E [Ri\ , z = 1, 2 

Wl, W2 


arg maxE + E [R2] 

Wl , W2 


p -> 


Wi,opt = Ui (Si) 

Wj,opt = any vector on Q{2, 1), 
j ^ i (See Prop. S]) 


Wi,opt = Ui (Si) 
W2,opt = Ul (S2) 

(See Prop. [3]) 


p intermediate 




wi,opt = ui ((a*(p)S2 + I)"'Si) 

W2,opt = Ui ((r(p)Si+I)-^S2) 

{a*{p),P*{p)} > 0, chosen 
appropriately (See Conjecture [D 


p — )■ cxo 


Wi,opt = Ui (Si) 
Wj- opt = U2 (Si) , 

j ^ i (See Prop. S]) 


Wi,opt = Ul (S2"^Si) 
W2,opt = Ul (Sf^S2) 

(See Theorems |2] and |3]) 
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Appendix 

A. Density Function of Weighted-Norm of Isotropically Distributed Unit-Norm Vectors 

Towards computing i], we generalize the technique expounded in [51] where the surface 
area (that is required) to be computed is treated as a differential element of a corresponding solid 
volume (at a specific radius value), and the volume of the necessary solid object is calculated 
using tools from higher-dimensional integration (geometry). In this direction, we have 

Pi{x) dx = P (hf^^^iAih-M,i e[x,x + dx]^ (100) 

- ^P(h^'d,.A,h;;d,.<x) (101) 



Pi{x) 



with 



where 



P(hf,_.A,hiid,i<a: 



Area (,r. 1) 
Area (1) 



(102) 



Area {x, y) = Area \ h.^^ ^Ki\i-M,i > x, ||hiid,i||^ = yj and (103) 

Area (y)^ Area (llhiid,, II' = y) (104) 

denote the area of a (unit radius) spherical cap carved out by the ellipsoid 

{hiid.i :hifd,,Aihiid,i = x} (105) 

and the area of a (unit radius) complex sphere, respectively. The volume of the objects desired 
in the computation of Pi{x) are 

Vol(x,r2) 4 Vol(h?,^,A,h„d,,>x, ||hiid,,||'<r2) 
Area {x, y) dy, 



y=o 



Vol 



(||hiid,i||' < r') = / Area(a;)cia;. 

^ ^ Jx=0 



(106) 
(107) 

(108) 



Thus, we have 



Area {x, 1) 
Area (x) 

Pi{x) 



_d_ 

Qj.2 

_d_ 



Vol (x, r') 
Volfr' 



r=l 



r=l 

and hence, 



g.Vol (:r. r^) 



dxr 



r=l 



J. Vol (r2) 



(109) 
(110) 

(111) 



r=l 
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It is important to realize that computing Vol (x,r^) is non-trivial even in the simplest case 
of M = 2. This is because every additional dimension to the complex ellipsoid corresponds to 
addition of two real dimensions, thus rendering a geometric visualization impossible. For exam- 
ple, with M = 2, we have the intersection of two four-dimensional real objects. Nevertheless, 
the following lemma captures the complete structure of Pi{x) when M = 2. 

Lemma 1: If M = 2, the random variable h^^ ■ Aj hijd, j is uniformly distributed in the interval 

Proof: First, note that it follows from [51, Lemma 2] that 

M 2M 

Vol(r2) = ^^. (112) 

For computing Vol (x, r^), we follow the same variable transformation as in [51]. We set hiid,i(fc) = 
rfcexp(j0fc) for k = 1,2. The ellipsoid is contained completely in the sphere of radius r if r is 
such that r > ^/ir—, whereas the sphere is contained completely in the ellipsoid if r < ^/ir- 



In the intermediate regime for r, a non-trivial intersection between the two objects is observed 
and one can compute the volume by performing a two-dimensional integration as follows: 

Vol(x,r^) = J J rir2(f)i(f)2dridr2d(f)id(f)2 (113) 
= (27r)'- / [ ridrir2dr2 (114) 



where 



J JB 

(27r)'- / r2dr2 / ndn (115) 
Jo Jl 



A = |ri,r2 : r^Ai^i + rlAi^2 > x, + < r'^^ 

and |0i,02 : [0,27r)}, (116) 
|ri,r2 : r^A^,! + r^Ai,2 > + < r^j, (117) 



B 



— ^, U = Jr^-rl r =- — — . (118) 

Ail V Aii-Ai9 



Trivial computation establishes the following: 



0' ^^^/aTT 



Vol [x,r^) = < 



^2 (r^ Ai^-xV 



2 A,,i-(A,,i-A,,2)' \/A,,i-' -\/A,,2 



<r<J-^ (119) 



III . ^ r> 

2 y A,,iA,,2; ' ' - V A,,2- 



Another trivial computation using (|1 111 ) results in 



P.(a;) = T • (120) 
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That is, ■ Ajh|id,i is uniformly distributed in its range. ■ 
The structure of Pi{x) gets more complicated as M increases. We now provide its structure in 
the M = 3 and M = 4 cases without proof. 

Lemma 2: With M = 3, the density function Pi{x) is of the form: 



Pi{x) 



0, 



(A,:, i-Ai, 3) (Ai, 2-A.i, 3) ' 

2{Ai,i-x) 
(A,;, i-Ai, 2) (Ai, i-Ai, 3) ' 

0, x>A.i. 



Ajo < a; < Aj 1 



With M = 4, the density function Pi{x) takes the form: 



0, 

3(x-Ai,4)^ 



Pi{x) 



(Ai, i-Ai, 4) (Ai, 2-Ai, 4) (Ai, 3-Ai, 4) ' 



(Ai, i-Ai^3) (Ai, 2-Ai,4) 



3(A,,i-x)^ 



(Ai, i-Ai, 2) (Ai, i-Ai,3) (Ai, i-Ai, 4) ' 

0, 



X < Aj_4 

Ai,4 <X< Aj^3 

Ai,3 < a; < Aj^2 
a; > Ai, 1 



where 



{x - Ai^a) (Ai,2 ^{x- Ai,4) (Ai,i 



Ai, 2 - A. 



i,3 



Aj, 1 - A, 



i,4 



(121) 



(122) 



(123) 




Fig. 5. CDF of weighted-norm of isotropically distributed beamforming vectors. 
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Fig. [5] illustrates the trends of the cumulative distribution function (CDF) by plotting the fit 
between the theoretical expressions in Lemmas [T] and |2l and the CDF estimated by Monte Carlo 
methods. The cases considered are: a) Aj = diag([2 1]) for M = 2, b) Aj = diag([3 2 1]) for 
M = 3, and c) Aj = diag([4 3 2 1]) for M = 4. The figure shows the excellent match between 
theory and Monte Carlo estimates. 



B. Rewriting the Rate Expression in the High-SNR Extreme 

A straightforward exercise shows that (|36^ can be rewritten as in (11241) below: 

E[R,] n~ l^(rf^^(wi,W2))+log(^l + |j^ -log(2) (124) 

where g{») is a function defined as 

g{z) ^ /(^) + 21og(z), (125) 



1 , fl + Vl 



z 



In (11241) . c?Si (wi, W2) is defined as 



with Ai, Bi and Ci as in Theorem [T] As illustrated in Fig. |6l /(•) is monotonically decreasing 
as a function of its argument and g{») is increasing with 

21og(2) = \img(z) < giz) < \\mgiz) = 2 (128) 

z^O 2— >-l 

oo = lim/(z)> f{z) >\imf{z) = 2. (129) 

Formal proofs of these facts are provided in Appendix O next. Some properties of dsX^i, ^2) 
are now established. 

• The quantity d-^^ (wi, W2) is a generalized "distance" semi-metric0 between wi and W2 
satisfying 

0<c?s, (wi,W2) <1. (130) 

To establish the lower bound in (11301 ). note that an application of the Cauchy-Schwarz 
inequality implies that Cf < AiBi. Equality in the lower bound in (11301 ) is achieved if and 

1/2 1/2 

only if S/ wi = C^/ ^2 for some C ^ C. Since Sj is positive-definite, this is possible 
only when wi = Cw2. Since both wi and W2 are unit-norm, this is possible only with 
ICI = 1. In other words, equality in the lower bound only occurs for wi = W2 on ^(2, 1). 

""A semi-metric satisfies all the properties necessary for a distance metric except the triangle inequality. 
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f(x) 

■ - g(x) 



o 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 

X 



Fig. 6. The behavior of f{x) and g{x). 



The fact that d-s^ (wi, W2) < 1 is obvious. Symmetry of the distance metric in wi and W2 
is obvious. 

The triangle inequality does not hold in general. One counter-example is as follows: 







S 


i = diag([ 


20, 


1]), Wi = 








" 1 -1" 














.71' 71. 




W3 = 




This choice results 


in 












c^s, (wi 


, W3) 




0.2536, 




(Wi,Vi^2 


) + 



1 

7!' 





0.2534. 



(131) 
(132) 

(133) 



Many such counter-examples can be listed out via a routine numerical search. Numerical 
studies also suggest that the triangle inequality holds for almost all choices of {wj} provided 
that Xi = 'xjj^l too large (unlike the example in (I131I )- (I132I) ). 

The upper bound in (|130l) is achieved only if Ai = Bi and = 0. By decomposing wi 
and W2 along the orthonormal set of basis vectors {ui(Sj), U2(Sj)}, it can be checked that 
Ai = Bi and Cj = is possible only if 



Wi 



W2 



Ui(S,) 



Xi 



Xi 



J (1^1+1^3) 



Ui(Si 



-e^-^-U2(S,) 



Xt 



Xi 



(134) 
(135) 



for some choice of Uj E [0, 27r), j = 1, 2, 3. 
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The semi-metric reduces to the standard chordal distance metric [42] on Q{2, 1) 

dY,, (Wi, W2) = 

if S, = AI for some A > 0. 



1 — |wf^W2 



(136) 



C. Monotonicity of f{») and g{ 
We first claim that 

1 



2 < 



log 



1 - vT 



< — , < ^ < 1, 



(137) 



which is equivalent to: 

exp (2 ■ Vl - \ < 



1 + VT 



1 - ^ 



< 



exp 



2VT 



< ;z < 1. 



(138) 



For this, we start with the exponential series expansion of exp (2 ■ \/l — z'^) that results in: 



exp 



2 ■ Vl 



k=l 

CO 



T{k + 1) 



< 1 + 25^(1-.2)^ = 1 + - 

k=i 



2VT^ 



(139) 
(140) 



where r(-) is the Gamma function, the second inequality follows from the fact that < 1 

for all A; > 1, and the last equality from the sum of an infinite geometric series. For the other 
side of (|137l) . note that 



exp 



2VT 



> 1 + 



> 1 



2VT 



2(1 -z' 



2va^ 



2VT 



1 - VT 



(141) 



(142) 



where the first inequality follows by truncating the terms of the asymptotic expansion and the 
second follows by using the fact that z'^ < 1. The proof is complete by noting that 

df{z) -z 2 1 



dz 

dgjz) 
dz 



—z 
z 



log 



1 + 



log 



i-VT 



< 
> 0. 



(143) 
(144) 
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D. Completing the Proof of Theorem |2] 

With the description of wi and W2 as in (|48l) -(l49l). elementary algebra shows that 

X 

|7p^i + \^?V2 



1 1 



A, = wfSiWi = ^i^^^^, X=||aS2-^vi + /3S-^V2|| (145) 



Bi = wfSiW2 = ^-^Y^^^, r = ||7S2^vi + 5S2^V2|| (146) 

Ci = |wi S1W2I = (147) 

A2 = wfS2W2 = -^ (148) 
B2 = wfS2Wi = ^ (149) 

C2 = |wfS2W2| = J ' ■ (150) 

As in the statement of the theorem, let Tj, i = 1, 2, 3 denote 

ri=vfS^ivi, r2 = vfS2^V2, r3 = vfS^iv2. (151) 

We can rewrite and in terms of {rj} as 

= |«|Vi + |/3|V2 + 2|«||/3||r3|cos(0i) (152) 

= |7|Vi + |5|V2 + 2|7||5||r3|cos(^2) (153) 

where 9i = arg(r3) + 9j3 — 9a and 6*2 = arg(r3) + 9s — 9^. Now note that if {vi, V2} is a pair 
of eigenvectors for S, then so is the pair {e^'^^ vi, e-''^^ V2} for any choice of ui and 1^2. In other 
words, the choice of {vi, V2} is unique only on Q(2,l), and not on St(2,l). Hence, arg(r3) 
can be chosen arbitrarily and independently in determining the values of and Y"^. With the 
specific choice that arg(vf S2 V2) = \ + 9^- 9f} in (I152D and arg(vf Eg V2) = f + 6*^ - 6*5 
in (11531) . we have 

X^ = |a|Vi + |/3|V2 (154) 

Thus, the high-SNR expression for the ergodic sum-rate can be simplified as 

2E [Ri]+2E [i?2] +41og(2) 

\(\a\'vi + WV2)-Y^+(\-l\''r]i + \5\^r]2)-X^] °V 
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Using (|126l) to rewrite the above equation in terms of /(•), we have after simplification: 

2v^Xr-|/37-a5| 



2E + 2E [R2] - log(r/ir72) = / 



(2XY ■\P^-a6\\ , / |/37-a5p 



We now claim that the following two inequalities hold: 



> ^^-L± (158) 



+ - ^^+^2 



XF- (|7| 


Pr/i + \S\%) 


(l«l 


Pr/i + |/3|2772)r^ 


^+(l7l 


IV + \6\^r]2)X^ 



> (159) 

The proof of (11581 ) and (11591) will be tackled later. 

Using (|158l) and (11591 ) in conjunction with the decreasing nature of /(•), we have 

'^y/Vimnr2 ■ m \/3-f-a6\ 



2E + 2E [R2] - log(r/ir/2) < / 



+ /|'Vn^.|g,-„,|U2logf , lf-°fi; I (160) 



Ti + r2 / vItP'?! + l'^P''?2 

_ / 2y/riiri2TiT2 ■ r]2 \f3-f - a6\ 

+ « ■ 1/^7 - .^1) + 2 log f . (161) 

V^l+^2 / y 4:TiT2r]2y/'nir]2 J 

Note that {9,} enter the above optimization only via the term \(3'j — a5\ and 

I/37 - a6\ < \a\^/l - |7|2 + I7I y^l - |aP (162) 

with equality achieved if and only if 9a + 9s — 9(3 — = tt (modulo 2n). Parameterizing |a| and 

I7I as |a| = sin(6') and I7I = sin(0) for some {9,(f)} G [0,7r/2], we have 

I/37 - a5\ < sm{9) cos(</)) + cos{9) sin((/)) = sin(^ + </>)< 1 (163) 
since < ^ + < tt. Further, r]i > 7]2 implies that |7p?7i + |5p^72 > and hence, we have 

l''^-"^! <1. (164) 



|7|2?7i + |5|2?72 ri2 

Using the fact that g{») is an increasing function, we get an upper bound for the sum-rate as 
2E \R,] + 2E \R,] < f ( + ; ( h/^) + ('A) ^ „«) 
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This upper bound is achievable with the choice of |a| = 1 and I7I = in (1481) and (1491 ). which 
is equivalent to (l44l) . Substituting this choice in the sum-rate expression yields 



Em^ElB,] l/(V!fn5)4/(?v^)a,og(*) (166) 

2 V ^71^-2 + ^2n / 2 \Ti + T2j 2 ^72/ 



2 
1 

2 ' 
+ - 



?7i'r2 + r]2Ti 



n + 1"2 



■ log 



Hi 

Vir2 + r]2Ti + \r]iT2 - r]2Ti\ 
V1T2 + mn - \r]iT2 - 772X1 1 



m - T2\ 

To simplify (11671) . we define ki and K2 as 

'ni'T2 



n - 


hT2 + 




- 7-2I 


n - 


VT2 - 




- T-2I 



■r]2T\ 



■ log 



and K2 = — . 



log 



(167) 



(168) 



The fact that rji > 772 implies that ni > K2. Thus, there are three possibilities: i) ni > K2 > 1, 
ii) Ki > 1 > K2, and iii) 1 > /ti > /t2. It is straightforward but tedious to check that in all of the 
three cases, (11671 ) reduces to (l45l ) as in the statement of the theorem. The proof will be complete 
if the inequalities (11581) and (11591) can be established. 
Proof of i \158i : For the first inequality, note that 

x2 + y2 X Y 1 ^ 
= — + — = t + - = 

XY Y X t 
can be written as a symmetric function q{t) in t where t 



(169) 



Y ■ 



^ Further, noting that q(t) is 
decreasing in t for t < 1 and is increasing in t for t > 1, the maximum of 



is achieved 



either when y achieves its largest or smallest value. The inequality in (|158l) follows since 




maxl^Ti, T2) 

Proof of i \159i : The proof of (11591 ) is more involved. For this, note that 

(|a| V + l/3|'^2)r2 + (|7| V + |5pr/2)X2 




Xr- (|7|2r/i + |5|2ry2) 

"1^(^71 - ^72) + ^72 



-T2) + T2 



Tl - T2) + T2 



1 + 



- T2) + T2 



|7P('7i - ^2) + ^72 |ap(n-^2) + ^2 



(170) 



(171) 



(172) 



By taking derivative with respect to note that the first term in (|1721 ) is increasing in jap for 
any fixed choice of I7I if and only if ^ > 1. Similarly, for any fixed choice of I7I, the second 
term in (|1721) is increasing in |ap if and only if ^ > ^. Thus, the condition 



1 < — < — (173) 

T"2 ri2 

is necessary and sufficient to ensure that for any choice of I7I, Ci is maximized by the choice 
I a I = 1. On analogous lines, taking the derivative with respect to |7p, it can be seen that for 
any fixed choice of both the terms in (|1721 ) are decreasing in I7P if and only if the same 
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condition in (11731) holds. In other words, under (|173l ). Ci is maximized by |a| = 1 and I7I = 0. 
At this stage, two other possibilities need to be considered: i) — < 1 < 21 and ii) 1 < 21 < il . 
In either case, we will show that 

\i\^{vi-V2) + V2 V2)'x [y \It2)\ V2\lnx^ 

which is equivalent to (11591 ). or the statement that Ci is maximized by \a\ = 1 and I7I = 0. For 
this, note that in either case, we have 



< 0, (174) 



-^2) +^2 ^ m ^^^^^ 



ll^iVl -^12) +V2 V2 



In the first case, we also have 



^<^<#<^/^, (176) 

where the last step in (11761 ) follows from ^ > 1- Combining (11751 ) and (11761 ), we note that (11741) 
is immediate when ^ < 1 < 21. in the second case, however, (|176l) is replaced with 

It can be seen that if |a| and I7I are such that 

V=0-'^A (178) 

for some choice of D satisfying I < D < ^ ■ (11741) holds immediately. Thus, we only need 
to show that (11741) holds when \a\ and I7I are such that 

V=^-^#. (179) 

Y ri2\ n 

for some choice of D satisfying ^ < D < 1. After some elementary algebra, our task is to 
show that 



ItPI'?! - V2) + V2 V2 \ \ V2 n 

^ ^ D.'JlM (181) 

Y V2\ n 

By bounding the denominator of (11801) as 772 < |7p(^i ^V2) +^2 < Vi^ it can be seen that (11801) 
holds if the following quadratic inequality in D is true: 

^72^1 V m(n-1"2)/ 112 V ^27-1/ ^?2(t-1-^2) 
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For this, we note that the left-hand side represents a convex parabola in D with maximum 

1. Substituting D = ^ and D = 1 and simplifying, we see 



achieved at either D = ^ or D 
that 



LHS of (fT82ll 



LHS of (fT82D 



D=l 



-T2 



V1V2T1 Ti - T2 

iVi - V2) ■ imri - r]iT2) 
viin - r2) 



iViin - r2) + n{r], - ri2)) < (183) 



< 0. 



(184) 



Since the maximum of the parabola in the domain ^ < -D < 1 is below 0, (11741) holds. Thus 



we are done with the aspect of showing that |Q;| = l,|7| = 0is sum-rate optimal. 



E. Proof of Theorem [3 

Following the logic of Appendix iDl we decompose wi and W2 along {ui,U2} since they 
form an orthonormal basis: 

wi = aui + (5vl2 (185) 

W2 = 7U1 + 5u2 (186) 

for some choice of {a,/3,7,(5} with a = \a\e^^" (similarly, for other quantities) satisfying 
+ = I7P + |5p = 1. We now study the ergodic sum-rate optimization over the six- 
dimensional parameter space. With the description of wi and W2 as in (|185l) - (|186l ). elementary 
algebra shows that 

(187) 
(188) 
(189) 
(190) 
(191) 
(192) 



A, 


= w^SiWi = 


|«|'Ai + |/3pA2 


Bi 


= W^SiW2 = 


l7l'Ai + |5|'A2 


Ci 


= wf^SiW2 = 


= |a*7Ai + (3*5X2] 


A2 


= W^S2W2 = 


|7|^/il + |5p/i2 


B2 


= wf^5]2Wi = 


ap/ii + /3^/i2 


C2 


= wf^S2W2 = 


= |a*7/ii +/3*(5/i2| 



and hence. 



rfSi(wi, W2)^ 

(is2(wi, wa)^ 



A{A^B, - Cl 

(A, + i?i)2 

4(^2^2 - Cl 
{A2 + B2f 



4A1A2 • 1/37 - aJp 



(|a|2+|7|2)Ai + (|/3|2 + |5|2)A2 
4/ii/i2 ■ I/37 - Q;5p 



(|«P+|7|2);.l + (|/3|2+|5|2)^2 



(193) 



(194) 
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The high-SNR expression of the ergodic sum-rate can be written as 

2E [Ri]+2E [i?2] + 41og(2) 

V4A1A2 ■ I/37 - a6\ \ f A/4/ii/i2 ■ I/37 - a6\ 



|apAi + 


|/3PA2 


I7PA1 + 





+ 2 log ( 1 + , ) + 2 log ( i + r r^ '^r ) • (195) 



+ |7|2)Ai + + \S\^)X,J ^ V(I«P + l7P)/ii + {W + W)P^2. 

I7IV1 + |(5|V2 
+ |/3|2/i2^ 

Note that {^^,} enter the above optimization only via the term |/37 — a5\ and as in (11621) . we 
have 

\h-a5\<\l3\\-i\ + \a\\5\<l. (196) 

Given that Xi = ^ > 1, three possibilities arise depending on the relationship between 1, xi 

and X2 = i) Xi > 1 > X2, ii) Xi>X2> 1, and iii) X2 > Xi > 1- 

Case i): In the first case where X2 < we use the fact that g{») is an increasing function to 
bound the sum-rate as 

2E [Ri]+2E [i?2] + 41og(2) 



sin^e) + sin2(0))(Ai - A2) + 2A2 7 \2fi2 - (sin^e) + sm\(j))){fi2 - l^i 



+ 2 log ( .^•""(^+^M + 2 log f ^-y^ + ^V ) . (197) 



^sin (0)(Ai - A2) + A2/ VAi2 - sin (6')(/i2 - /i 

Observing that 

(sin2(^) + sin2(0))(Ai-A2) + 2A2 < sin2(0)(Ai - A2) + Ai + A2 (198) 

2/i2-(sin2(^) + sin2((/))) (^2-/^1) < 2/i2 - sin2(e)(^2 - /ii) (199) 
and /(•) is a decreasing function, we have 
2E [Ri] + 2E [i?2] +41og(2) 



^ I V4A1A2 ■ sin(g + 0) \ \ f f V^m^-sm{e + 



sin (0)(Ai - A2) + Ai + A2/ v2/i2 - sin {6){fi2 - A^i 



+ 2 log I -'"f ) H- 2 log ( ^,;:7"' + J (2W) 

sm (0)(Ai - A2) + Aa/ \fi2 - sm (6')(/i2 - /ii 



V4A1A2 ■ sin(6' + 0) A / V4AiiAi2 ■ sin(6' + 



sin^(0)(Ai - A2) + Ai + A2/ V2yU2 - sin^(6')(/i2 - 1^ 



+ 2 log f 1 + . 2,^v/' X A , X ) + 2 log f 1 + 7) . (201) 

V sm^(0)(Ai - A2) + A2/ V fi2 - sm\e){fi2 - 1^1) J 

It is straightforward to note that the right-hand side of (12011 ) is decreasing in sin^(0) and 

increasing in sin^(^^). If in addition, the condition that 9 + (p = n/2 is satisfied, then an upper 
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bound on E [Ri] + E [R2] can be maximized. This results in the choice d = ix jl and = (that 
is, |«| = 1 and I7I = 0) and with this choice, we have 



It is also straightforward to check that the choice of wi and W2 as in the statement of the 
theorem meets this upper bound and is thus optimal. 

Case ii): In the second case where xi > X2 > 1j we start as in Case i) and after optimization 
over {6',}, we can bound the sum-rate as 

2E + 2E [i?2] +41og(2) 



^ ^ I 2VH-sin(g + 0) ■ sin(^ + 



(Xi-l)(sin^W + sin^(0)) + 2; ^ V(X2 - l)(sin^W + sin^(0)) + 2 
/ VH;sin(^+0) V . , ( 27x^ ■ sin(g + 0) ^ 



2yxr-sin(^ + 0) \ , / 2yx^-sin(^ + 



(Xi - l)(sin^(e) + sin^(0)) + 2; " V(X2 - l)(sin^(^^) + sin^(0)) 



(Xi-l)(sin^W + sin^(0)) + 2 ^ /(X2 - l)(sin2(^) + sin2(0)) + 2 

+ 2 log I 7^ ,^ 2/ , . ) + 2 log ' 



(Xi-l)sin^(0) + l ) "V (X2-I)sin^(^) + 1 

(204) 

The joint dependence between 6 and cp in the right-hand side of (12041) precludes the possibility 
of breaking down the double variable optimization of (12041 ) into a pair of single variable 
optimizations. That is, the technique from Case i) fails here and this case needs to be studied 
differently. 

The proof in this case follows in three steps. In the first step, when xi > X2, we show that 
£2 (defined as below) is maximized by 6^ = 7r/2 and = 0: 

^ ^ (Xi-l)(sin^W + sin^(0)) + 2 (x2 - l)(sin^ W + sin^(0)) + 2 

(Xi-l)sin2(0) + l ■ (x2-l)sin2W + l ^ ^ 

_ (Xi - 1)(X2 - 1) [sm\9) + sin^(0)]' + 2 (xi + X2 - 2) [sin^(^) + sin^(0)] + 4 
[(Xi - 1) sin2(0) + 1] [(X2 - 1) sm'{9) + l] 
For this, we set sin^(0) + sin^(0) to take a specific value of a. Since the numerator is only a 
function of a, maximizing £2 is equivalent to minimizing the denominator of (|206l ). There are 
two possible cases depending on whether a;<lorl<Q;<2. In the former case, it can be seen 
that the denominator is minimized by = and 6 = sm^^{^/a), whereas in the latter case, it 
is minimized by = sm~^ {^/Q'^^) and 9 = n/2. Substituting these values, it can be seen that 



(Xi-l)(X2-l) «^+2a(xi+X2-2)+4 if q; < 1 

Co < { l+(X2-l)a - C207) 

- ^ (Xi-l)(X2-l) a^+2a(xi+X2-2)+4 jf X < q, < 2 
X2-[l+(xi-l) — 
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A straightforward derivative calculation (using the critical fact that xi > X2) shows that while 
the right-hand side of (12071 ) is increasing for a < 1, it is decreasing for 1 < a < 2. In other 
words, 

< + + (208) 

and this upper bound is achieved with 9 = n /2, (p = 0. 
In the second step, if^^ + 0>7r/2, we have 

sm(^ + 0) ^sHe+^^^^ ^^^^2 (209) 



(Xi-l)(sin2(0) + sin2(0))+2 - Xi + ^ ~ X^ + ^ 
since sin(6') > sin(7r/2 — (p) = cos(0). Therefore, 

2E [Ri] + 2E [i?2] +41og(2) 



- ^ ^ (Xi - l)(sin2(^) + sin'm + 2J^^[{X2- l)(sin2(^) + sin^l^)) + 2)^ 

(210) 

<9[^]+9(^]+2logiC,), (211) 



xi + iy VX2 + 1 

2E [R,] + 2E [R2] < f (^^) + f (^^) + log ] , (212) 



,Xi + l/ VX2 + I/ \X2, 
where the second inequality follows from the monotonicity of g(») and the third from Step 1. 

Note that (12091 ) fails if 9 + (j) = v < n/2. Thus, in the third step, we consider this possibility. 

Here, a straightforward manipulation shows that 

sm^{e) + sin^iu ~ 0) = sm^{iy) - 2 sin(^) sin(z/ - 6) cos{u) < sm^{u), (213) 

where the last inequality follows because u < n/2. Hence, we have 

/ X / 2,/Y7sin(z/) \ / 2./Y^sin(z/) 
2E [R,] + 2E [R2] + A\og{2) < gl V^i l ; VA2 i; 



iXi - 1) sm\u) + 2j " V(X2 - 1) sm\u) + 2 
, [(xi-l)sinV)+2].[(x2-l)sin» + 2] \ , 
+ ' [ix, - 1) sin2(z. -e) + l]. [ix2 - 1) sin^(^) + l] ) ^''^^ 
It can be easily seen that £3 is maximized by = 7r/2 and = 0. The upper bound is the same 
in both the cases 9 + (p < n/2 and 9 + (p > 71/2. And this upper bound is met by the choice 
9 = Tx /2 and = (that is, |a| = 1 and I7I = 0) and is hence optimal. 

Case Hi): Since the expression for the sum-rate is symmetric in xi and X2, an argument analogous 
to Case ii) completes the theorem in the case xi ^ X2- 

The optimal sum-rate in all the three cases is given by the unified expression 



2E m + 2E [R2] f { ) + / ( ) + log (xi) - log (X2) 

\Xi + 1/ VX2 + 1 ' 



(215) 



where /(•) is as defined in (11261) . This expression can be simplified as in the statement of the 
theorem. ■ 
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F. Comparison of Proof Techniques of Theorems |2] and \3\ 

We first show that Theorem |2]reduces to Theorem |3] under the assumption that the eigenvectors 
of Si and S2 coincide. For this, we set 



7 



5' 



(216) 



where X and Y are as in (|152l) and (|153l ). respectively. Note that the above transformation is a 
bijection from the space ja, /3, 7, 5 : |ap + = 1 = I7P + to the space 7', 5' : 

+ = 1 = |7'P + With this transformation, it can be checked that (11561 ) reduces 

to (11951 ). It can also be seen that the sum-rate expression in (|45l) reduces to that in (|66] ) in both 
cases. 

The technique pursued in the general case diverges from that in Appendix |E] in two ways. 
Difference 1: It can be easily checked that rs = if and only if the set of eigenvectors of Si 
and S2 coincide. In general, ra 7^ and arg(r3) could affect the sum-rate optimization. The 
first step in Appendix |D] is to show that this is not the case and arg(r3) plays no role in the 
optimization. This is done by exploiting the fact that the sum-rate optimization (see ([14])) is a 
problem over Q(2,l) and not over St (2, 1). 

Difference 2: The second complication is that there is a definitive (and easily classifiable) 
comparative relationship between ti = vf^Sg^vi and T2 = v|^S^^V2 in the special case. This 
comparative relationship does not generalize to the setting where the eigenvectors of Si and S2 
are different. 

Specifically, under Case i) of the discussion in Theorem IH ri > r2 if and only if X2 < 1 
whereas under Case ii), ti > T2 if and only if X2 > 1- On the other hand, in the general case, 
all the three possibilities: i) ri > r2, ii) ri < r2, iii) n = T2 can occur for appropriate choices 
of Si and S2. We now illustrate this with a numerical example. Let S2 be fixed such that 



1 

-0.6897 



-0.6897 
1 



(217) 



With the choice 



Si 



it can be seen that rji = 5.8, 772 



1 0.8 
0.8 1 
0.1184, Ti = 3.2222 and T2 

1 -0. 
0.8 1 



(218) 



0.5918, whereas if 



Si 



(219) 
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it can be seen that r]i = 1.0653, r^a = 0.6444, ri = 0.5918 and = 3.2222. It can be seen that 
r/i = 1.4603,7/2 = 0.5397 and n = T2 = 1.90725 if Si satisfies 



Si 



-0.34485 



(220) 



-0.34485 i 

These differences imply that, in general, there exists no bijective transformation (as in (|216l )) 
to transform the objective function from the form in (|1561 l to that in (11951) . Despite these issues, 
it would be of interest to pursue a theme that could unify the general case with the special case. 



G. Proof of PropM 

The proof in the low-SNR extreme is obvious. In the high-SNR extreme, we first note that the 
optimization problem over the choice of a pair (wi, vi^2) that results in a corresponding choice 
of {Ai, Bi, Ci) can be recast in the form of a two parameter optimization problem over (Mj, A^j) 
with Mj = 4^ and Ni = under the constraint that 

0<N^ <Mi<x^ = ^4i4- (221) 
For this, observe that for any given choice of (wi, W2), the resultant {Ai,Bi,Ci) has to satisfy 

A, 



Cf < AiBi (Cauchy-Schwarz inequality) and 4^ < Xi (Ritz-Raleigh ratio) [47]. Thus, we have 



max hm E [Rj\ < max hm E [Ri] . (222) 

wi , W2 p->oo 0<Nf < Mi <XiP^°° 

Since the high-SNR expression for E [Ri\ satisfies 

(2 / M A'^'^ \ 
' j + 2 log (1 + M,) , (223) 

and g{») is an increasing function, optimization over Ni which affects only the first term on the 
right-hand side implies that the optimal choice of Ni is zero. Plugging this choice and using the 
structure of g{») and /(•) from (|125l ) and (|126l ) respectively, we have 

Em<j^^^^-\\og{Mi)\. (224) 

The right-hand side of (12241) is increasing in Mj since the derivative function satisfies 



^ Af,-|log(A/.)| 
" \M,-1\ 

dMi 



M,-l-log(M,) .J- . 



< 



\ if Mi = 1 (225) 

^"'^ if M < 1, 



where all the three pieces are positive and hence, the derivative function is smooth. The goal of 
maximizing Mj under the constraints of A'j = and unit-normedness of wi and W2 is met by 
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the choice as in the statement of the proposition. This upper bound to E [Ri] is also met by the 
same choice of beamforming vectors and this choice is thus optimal. 

Recall from (|124l) (the high-SNR expression) that E[Ri] is the sum of two terms. The 
increasing nature of g{») means that the first term of ()124l) is maximized when (isi(wi,W2) 
is maximized. That is, by the choice {wi, W2} as in (|134I) - (I135I) . On the other hand, the second 
term as well as E [Ri] (which is the sum of the two terms) are maximized by the choice in (|85l) . 
With this choice of beamforming vectors, (is, (■, ■) can be written as 
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